Exploring Highly Structure Similar Protein Sequence Motifs using SVD with Soft Granular Computing Models
نویسندگان
چکیده
Vital areas in Bioinformatics research is one of the Protein sequence analysis. Protein sequence motifs are determining the structure, function, and activities of the particular protein. The main objective of this paper is to obtain protein sequence motifs which are universally conserved across protein family boundaries. In this research, the input dataset is extremely large. Hence, an efficient technique is demanded. A Rough Granular computing model is created to efficiently extracting protein motif data that transcends protein families. Before apply this model, the very first step of this research is trying to reduce segments. The literature suggests that the Singular Value Decomposition (SVD) computing technique is more suited for reducing segments. After that the reduced segments are followed by applying Rough Granular computing model. The effectiveness of final results effectiveness is tested by several measures. The experimental results suggest that the SVD with Rough Granular computing model generates more number of highly structured motif patterns. KeywordsProtein Sequence Motifs, DBI, DI, HSSP-BLOSUM62, Granular Computing, K-Means, Adaptive Fuzzy C-Means, Rough K-Means.
منابع مشابه
Exploring Highly Structure Similar Protein Sequence Motifs using Granular Computing Model based on Adaptive FCM
Protein sequence motifs are very important to the analysis of biologically significant conserved regions to determine the conformation, function and activities of the proteins. These sequence motifs are identified from protein sequence segments generated from large number of protein sequences. All generated sequence segments may not yield potential motif patterns. In this paper, short recurring...
متن کاملSoft Granular Computing Model for Identifying Protein Sequence Motif Based on Svd-entropy Method
Bioinformatics is a field devoted to the interpretation and analysis of biological data using computational techniques. In recent years the study of bioinformatics has grown tremendously due to huge amount of biological information generated by scientific community. Proteins are made up of chain of amino acids. Protein sequence motifs are small fragments of conserved amino acids often associate...
متن کاملProtein Sequence Motif Detection using Novel Rough Granular Computing Model
Protein sequence motifs information is essential for the analysis of biologically significant regions. Discovering sequence motifs is a key task to realize the connection of sequences with their structures. Protein sequence motifs have the potential to determine the function and activities of the proteins. Many algorithms or techniques are used to determine motifs which require a predefined fix...
متن کاملNovel efficient granular computing models for protein sequence motifs and structure information discovery
Protein sequence motifs have the potential to determine the conformation, function and activities of the proteins. In order to obtain protein sequence motifs which are universally conserved across protein family boundaries, unlike most popular motif discovering algorithms, our input dataset is extremely large. As a result, an efficient technique is demanded. We create two granular computing mod...
متن کاملDiscovery and Extraction of Protein Sequence Motif Information that Transcends Protein Family Boundaries
Protein sequence motifs are gathering more and more attention in the field of sequence analysis. The recurring patterns have the potential to determine the conformation, function and activities of the proteins. In our work, we obtained protein sequence motifs which are universally conserved across protein family boundaries. Therefore, unlike most popular motif discovering algorithms, our input ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016